Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech
نویسندگان
چکیده
Almost all speech contains simultaneous contributions from more than one acoustic source within the speaker’s vocal tract. In this paper, we propose a method—the pitch-scaled harmonic filter (PSHF)—which aims to separate the voiced and turbulence-noise components of the speech signal during phonation, based on a maximum likelihood approach. The PSHF outputs periodic and aperiodic components that are estimates of the respective contributions of the different types of acoustic source. It produces four reconstructed time series signals by decomposing the original speech signal, first, according to amplitude, and then according to power of the Fourier coefficients. Thus, one pair of periodic and aperiodic signals is optimized for subsequent time-series analysis, and another pair for spectral analysis. The performance of the PSHF algorithm was tested on synthetic signals, using three forms of disturbance (jitter, shimmer and additive noise), and the results were used to predict the performance on real speech. Processing recorded speech examples elicited latent features from the signals, demonstrating the PSHF’s potential for analysis of mixed-source speech.
منابع مشابه
Speech analysis by subspace methods of spectral line estimation
Over frames of short time duration, filtered speech may be described as a finite linear combination of sinusoidal components. In the case of a frame of voiced speech the frequencies are considered to be harmonics of a fundamental frequency. It can be assumed further that the speech samples are observed in additive white noise of zero mean, resulting in a standard signal-plus-noise model. This m...
متن کاملDecomposition of Speech into Voiced and Unvoiced Components Based on a Kalman Filterbank
We present a novel method for decomposing speech into signals representing the voiced and unvoiced components of speech. The method involves first demodulating the variations in spectral envelope, energy and pitch, and then applying a bank of Kalman filters to separate the harmonic and non-harmonic components of the signal. The use of Kalman filters relies on a state-space representation of the...
متن کاملLow-Complexity Pitch Estimation Based on Phase Differences Between Low-Resolution Spectra
Detection of voiced speech and estimation of the pitch frequency are important tasks for many speech processing algorithms. Pitch information can be used, e.g., to reconstruct voiced speech corrupted by noise. In automotive environments, driving noise especially affects voiced speech portions in the lower frequencies. Pitch estimation is therefore important, e.g., for in-car-communication syste...
متن کاملHMM-Based Speech Enhancement Using Pitch Period Information in Voiced Speech Segments
An extension of the HMM-based speech enhancement approach [1] is presented. The HMM-based scheme uses hidden Markov models (HMM) to control a state-dependent Wiener filter, which is used to process the noisy speech signal. This scheme gives enhanced speech signals without the annoying tonal artefacts (‘musical noise’) of the spectral subtraction approach. However, parts of the enhanced signal o...
متن کاملNew Time-frequency Domain Pitch Estimation Methods for Speech Signals under Low Levels of Snr
New Time-Frequency Domain Pitch Estimation Methods for Speech Signals under Low Levels of SNR Celia Shahnaz, Ph.D. Concordia University, 2009 Pitch estimation of speech signals is the key to understanding most acoustical phenomena as well as accurately designing many practical systems in speech communication. It is to determine the fundamental frequency or period of a vocal cord vibration causi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- IEEE Trans. Speech and Audio Processing
دوره 9 شماره
صفحات -
تاریخ انتشار 2001